o 



Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 22 July 2011 (MN JAT^ style file v2.2) 

Convergence of Galaxy Properties with Merger Tree 
Temporal Resolution 

Andrew J. Benson^*, Stefano Borgani^'^''^, Gabriella De Lucia^, Michael Boylan-Kolchin^ 
&; Pierluigi Monaco^'^ 

1 California Institute of Technology, MC350-17, 1200 E. California Blvd., Pasadena, CA 91125, U.S.A. 

^ Dipartimento di Fisica dell'Universita, Sezione de Astronomia, via Tiepolo 11, 1-34131 Trieste, Italy 

^ INAF — Isservatorio Astronomico di Trieste, via Tiepolo 11, 1-34-131, Trieste, Italy 

^ INFN — Istituto Nazionale di Fisica Nucleare, Trieste, Italy 

^ Department of Physics and Astronomy, Center for Cosmology, University of California, 4129 Reines Hall, Irvine, CA 92697, USA 



o 

CN 22 July 2011 



o 
u 

6 



> 

00 
ON 
O 
^. 

o 



X 



ABSTRACT 

Dark matter halo merger trees are now routinely extracted from cosmological simula- 
tions of structure formation. These trees are frequently used as inputs to semi-analytic 
models of galaxy formation to provide the backbone within which galaxy formation 
takes place. By necessity, these merger trees are constructed from a finite set of dis- 
crete "snapshots" of the N-body simulation and so have a limited temporal resolution. 
To date, there has been little consideration of how this temporal resolution affects the 
properties of galaxies formed within these trees. In particular, the question of how 
many snapshots are needed to achieve convergence in galaxy properties has not be an- 
swered. Therefore, we study the convergence in the stellar and total baryonic masses 
of galaxies, distribution of merger times, stellar mass functions and star formation 
rates in the Galacticus model of galaxy formation as a function of the number 
of "snapshot" times used to represent dark matter halo merger trees. When utilizing 
snapshots between z = 20 and z = 0, we find that at least 128 snapshots are required 
to achieve convergence to within 5% for galaxy masses. This convergence is obtained 
for mean quantities averaged over large samples of galaxies — significant variance for 
individual galaxies remains even when using very large numbers of snapshots. We find 
only weak dependence of the rate of convergence on the distribution of snapshots in 
time — snapshots spaced uniformly in the expansion factor, uniformly in the logarithm 
of expansion factor or uniformly in the logarithm of critical overdensity for collapse 
work equally well in almost all cases. We provide input parameters to Galacticus 
which allow this type of convergence study to be tuned to other simulations and to be 
carried out for other galaxy properties. 
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1 INTRODUCTION 



Simulations of the cosmological evolution of large scale 
structure and individual galaxies in cold dark matter cos- 
mogonies (Springel ot al. 2005; Kuhlcn ct al.|2008[ Springel] 



et al.|[20C)8, . Boylan-Kolchin et al.||2009i .Klypin et al.||20i^ 



Prada et al. 12011 1 are an invaluable tool in studying the for- 



mation and evolution of galaxies. With the high resolutions 
and large volumes attained by modern simulations, a par- 
ticularly useful approach is to extract information on indi- 
vidual dark matter halos and how those halos merge during 
the process of hierarchical structure formation. It is within 
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these merging hierarchies of dark matter halos that galaxies 
form. Typically, this information on hierarchical growth is 
encoded as a "merger tree" represented by a set of "nodes" 
(each corresponding to a dark matter halo at one point in 
its evolution) and "branches" connecting those nodes which 
link descendant halos to their progenitors (i.e. halos which 
will merge together to form the descendant). While these 
merger trees have many applications, a very common use is 
an input to models which aim to solve the physics of galaxy 
formation within the merging hierarchy of dark matter — in 



particular the class known as "semi-analytic models" (Cole 



fc Blaizot|2007[ [Monaco et al.|2007[ |Somerville et al.|2008 i' 
Due to the way N-body simulations are analyzed and 



et al.|2000||Hatton et al.|2003l[Croton et al.|2'006l|De Lucia 
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their data stored, merger trees are usually constructed from 
a set of "snapshots" — outputs of all of the particle data at a 
set of A*' times, ti where z = 1 to A'^. Since A'" is finite (and 
often limited by available storage and processing power) this 
means that each merger tree is a temporally sparse represen- 
tation of some underlying true merger tree which may have 
structure on shorter timescales. Such structure is lost in the 
sparse representation. The question of how this loss of infor- 
mation affects the properties of galaxies produced by models 
utilizing sparse merger trees has been touched upon before, 



but never carefully studied. For example, Helly et al. ( 2003 1, 



utilizing merger trees with 44 snapshots (spaced uniformly 
in the logarithm of expansion factor) from z = to z = 20, 
compared their results to those obtained from their stan- 
dard Galform model based on merger trees constructed 
from Press-Schechter techniques using a significantly larger 
number of snapshots and noted that "... we find that if we 
degrade the time resolution of the standard Galform model 
to match that of our N-body model the properties of the 
galaxy populations predicted change very little." However, 
this statement refers only to statistical properties of the 
entire galaxy population rather than to individual galax- 
ies and, in any case, no details were given. |Hatton et al.| 
( 2003 1 utilized merger trees with around 70 snapshots be- 
tween 2 = and z = 10. They repeated their calculations 
using only every second snapshot and found that this leads 
to a 20% scatter in cold gas content in galaxies, and a 40% 
scatter in the mass of hot gas associated with each galaxy. 
They concluded that this was not a significant problem, but 
this is arguably a significant discrepancy given the accu- 



racy of modern observational datasets. Croton et al. (20061 



describe a galaxy formation model utilizing merger trees ex- 
tracted from the Millennium Simulation with 60 snapshots 
between z = Q and z = 2Q but do not discuss if this number 
is sufficient to achieve converged results. |Somerville et alT] 
( |2008[ ) present a model based on merger trees constructed 
using an extended Press-Schechter algorithm and note that 
they find no significant changes if they instead implement 
their model in merger trees extracted from N-body simula- 
tions, but do not provide further details. 

Importantly, none of these studies were able to address 
the issue of how fast galaxy properties converge as the num- 
ber of snapshots is increased, or what is the optimal distribu- 
tion of snapshots in time. Additionally, in all of the galaxy 
formation models employed at least some of the baryonic 
physics (e.g. the rate of gas cooling) is solved using timesteps 
tied to the snapshot spacing — this is non-optimal as it is in 
principle possible to obtain better convergence by solving 
the baryonic physics on a much shorter timescale (and in- 
terpolating the dark matter halo properties as necessary). 

In this work, we address these issues by utilizing the 
Galacticus semi-analytic galaxy formation code ( |Benson| 
2011 1. This code has two key features which make it ideally 



suited to this problem. The first is its ability to construct 
merger trees using a modified extended Press-Schechter for- 



malism ( Parkinson et al. 2008 1 which both agree with the 



statistics of merger trees extracted from N-body simulations 
and which can be built without reference to any fixed grid 
of snapshot times, thereby achieving arbitrarily high tem- 
poral resolution. As we will describe below, Galacticus is 
able to post-process these trees to construct sparse repre- 
sentations in the same manner as trees extracted from an 



N-body simulation. The second key feature of Galacticus 
is that all baryonic physics is solved using adaptive timesteps 
which are adjusted to keep a specified tolerance in the quan- 
tities being computed. As such, the number and distribu- 
tion of snapshots in the merger trees affects the results from 
Galacticus only due to the inevitable information loss re- 
sulting from the sparse representation. We exploit these fea- 
tures to explore convergence of basic galaxy properties as a 
function of the number of snapshots available and the dis- 
tribution of those snapshots in time. 

The remainder of this paper is arranged as follows. In 
32] we describe the process by which we construct merger 
trees and alter their temporal resolution. In Owe present 
results from our convergence study and in pTwe give our 
conclusions. 



2 METHOD 

We build merger trees using Gal acticus's modified ex- 
tended Press-Schechter algorithrrrj ( [Parkinson et aL 2008 
Benson 20111. This algorithm makes no reference to any 



fixed timesteps, but instead steps along each branch of 
the tree using a step that is controlled by three pa- 
rameters. The first parameter, called ei by IParkinson] 
et al. ( 2008 1 and corresponding to the input param- 
eter [modif iedPressSchechterFirstOrderAccuracy] in 
Galacticus, ensures that the merger rate equation de- 
rived 



by 
curate. The 



Parkinson et al. ( 2008 1 is first order ac 
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second parameter, called £2 by |Parkin-| 
( 2008 1 and corresponding to the input pa- 



rameter [mergerTreeBuildCole2000MergeProbability] in 
Galacticus, limits the probability that an above-resolution 
binary split occurs during the step. The final parameter, 
which we will label e^ and which corresponds to the input 
parameter [mergerTreeBuildCole2000AccretionLiniit] in 
Galacticus, limits the maximum fractional amount of 
smooth accretion (i.e. sub-resolution merging) over the 
stepPl We typically set the values of these parameter to 0.1 
(as do I Parkinson et aL ( 2008 \ for ei and £2) but have checked 
that reducing them by an order of magnitude does not affect 
the galaxy properties that result when using these trees (see 



S2.ll 



Having built trees, we use the Galacticus task 
mergerTreeRegridTimes to force the trees onto a pre- 
specified time grid — a process that we will refer to as "re- 
gridding". This process works as follows. We begin by defin- 
ing a set of times, ti with i = 1 to A', which would correspond 
to the snapshots of an N-body simulation. Given a merger 
tree, we walk along each branch and look for branches (i.e. 
connections between nodes) which span one or more of these 
times. When such a branch is found, we interpolate the mass 
of the halo along the branch if this is a primary progeni- 
tor (otherwise we keep the mass of the halo fixed along the 



be found 

available 



in 
online 



the 
at 



^ Full details can 

Galacticus manual 

'http://wM w.ctcp.c al tech .edu7galacticus/Galactlcus_v0.9.0.pdf I 
^ Parkinson et al.l (|2008|l do not include this constraint when 



setting timesteps during tree building. We include it to ensure 
that the mass vs. time relation is sufficiently well resolved along 
each branch in the regime of smooth accretion. 
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branch) to each intersected time and create a new node at 
that time with the interpolated masa^ Having done this for 
every branch of the tree we then walk the tree again re- 
moving all nodes which do not fall on one of the snapshot 
times. When removing a node, we connect any children to 
the node's parenlrl In this way we preserve the merging 
structure of the tree. Figure [T] shows a representation of a 
typical merger tree re-gridded in this way. Circles represent 
nodes in the tree with lines showing the branches. Circle 
size is proportional to the logarithm of halo mass and time 
runs down the page on a logarithmic axis (such that the 
present day is at the bottom of the figure). Open symbols 
show nodes in the original merger tree, built without ref- 
erence to any set of snapshot times. Dotted lines show the 
branches in this tree. Filled circles show interpolated nodes 
added to the tree at each snapshot time, with solid lines in- 
dicating the branches in the resulting sparse tree. Note that 
in the original tree only binary mergers occur by construc- 
tion (indicated by filled dots in the centres of open circles 
in the right-hand panel) , while in the sparse tree non-binary 
mergers can occur. 

We consider three different options for setting the snap- 
shot times. In each case, t\ corresponds to 1 -I- z = 20 while 
In corresponds to 1 -\- z = 1. We then consider cases where 
snapshots are uniformly spaced in expansion factor, a, are 
uniformly spaced in the logarithm of expansion factor, ln(a), 
and are uniformly spaced in the logarithm of the critical lin- 
ear theory overdensity for collapse. In Jc, as this is a natural 



timescale for structure formation (see Benson et al. 2005 1 



Examples of the resulting distribution of snapshot times are 
shown in Fig. [2] Note that the "uniform in In a" and "uni- 
form in In (5c" distributions are quite similar due to the fact 
that (5c oc 1/a at high redshifts. In each case we consider 
A'^ = 16, 32, 64, 128 and 256 to explore increasingly well- 
resolved trees. 

For each set of snapshot times we use Galacticus to 
compute the properties of galaxies forming in a large number 
of realizations of merger trees spanning the mass range 10^^ 
to 10^'^ Mq. In each case we also run a model in which no re- 
gridding of the merger trees occurs — we will use this as our 
reference point corresponding to A'^ — > oo. For specificity 
we adopt cosmological parameters corresponding to the 
WMAP-7 dataset (^o = 0.2725, ^a = 0.7275, fib = 0.0455, 
as =0.807 , Hq = 7 0.2, n^ = 0.961, A^eff = 4.34; |Komatsu| 
et al.|2011 l, use the Eisenstein & Hu (19991 fitting formula 
for the cold dark matter transfer function and a merger tree 
mass resolution (i.e. the lowest mass halo which is tracked 
in the merger tree) of 1.2 x IO^Mq. This specific mass res- 
olution was chosen to match that used in work currently in 
preparation, but corresponds to a moderately well resolved 



^ When constructing trees using a modified extended Press- 
Schechter algorithm the primary progenitor is identified as the 
most massive progenitor at each bifurcation of the merger tree. 
When solving galaxy formation physics, Galacticus performs 
the same interpolation of halo mass along each branch to provide 
a smooth evolution. Thus, this interpolation of the merger trees 
is consistent with the way in which galaxies and halos are evolved 
in Galacticus. 

■* By "children" we mean direct progenitors of a halo existing at 
an earlier time and by "parent" we mean the halo into which the 
halo in question will merge in the future. 
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Figure 2. The snapshot times used in this work when beginning 
snapshots at 2 = 20. Colours correspond to different snapshot 
distribution choices as indicated in the figure. Larger points cor- 
respond to the snapshots used in coarser re-griddings. 



halo (i.e. a little over 100 particles) in the recent Millennium- 
II simulation { Boylan-Kolchin et al.|2009 1 . We consider two 
sets of input parameters to the Galacticus model. In the 
first, which we will refer to as the "full" model, we employ 



the full set of galaxy formation physics (see Benson 20111 



with parameters chosen to provide good fits (Benson 2011, 
in preparation) to the local galaxy stellar mass funct ion ([Li 
fc White||2009 ), the local galaxy HI mass function (Zwaan 
et al.||2005 \ and the star formation history of the Universe 
( Hopkins||2004 1 . For the second set of input parameters we 
consider a simplified model, which we refer to as the "sim- 
ple" model, in which we switch off all feedback effects due 
to supernovae and AGN (and thereby prevent any outfiows 
of mass from galaxies) but leave all other parameters un- 
changed. 



2.1 Convergence in Tree Building and Baryonic 
Solver 

To reliably test the convergence of galaxy properties with 
respect to the temporal resolution of merger trees we must 
first confirm that the galaxy properties are converged with 
respect to other numerical parameters in Galacticus. In 
particular, there are two separate issues which must be con- 
sidered. The first issue is the numerical accuracy with which 



the TV. 



step 



oo trees are built. The second issue is the accu- 



racy with which the ordinary differential equations (ODEs) 
describing baryonic physics are solved. 

Figure [3] demonstrates the convergence of galaxy prop- 
erties with parameters controlling these aspects of Galacti- 
cus in A^stop = oo merger trees. We compare our stan- 
dard calculation, computed with ei = e2 = £3 = 0.1 
(which control the accuracy of merger tree construction) 
and rtoi = atoi = 0.01 (which control the tolerance in the 
ODE solver used for evolving baryonic physics and which 
correspond to input parameters [odeToler2mceRelative] 
and [odeToleranceAbsolute] in Galacticus) with calcu- 
lations in which these two sets of parameters are decreased 
by a factor of 10. For the simple model, the results are clearly 
extremely well converged with respect to these parameters. 
For the full model the scatter on the mean values are larger 



4 Benson et al. 





Figure 1. An example of a merger tree interpolated onto a fixed grid of timesteps. Circles represent nodes (halos) in the merger tree, 
with radius proportional to the logarithm of halo mass. Logarithmic time runs down the page with z = at the bottom of the figure. 
Open symbols and dotted lines represent nodes in the original tree and their connecting branches, while filled symbols and solid lines 
show nodes in the re-gridded tree and their connecting branches. The left view shows the full tree, while the right view shows a zoom in 
to the final five snapshot times in the same tree. In the zoomed view, nodes in the original tree which have two progenitors (i.e. are the 
result of a merger) are indicated by a filled dot in the center of the corresponding open circle. 
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than in the simple model. This is not surprising as the ad- 
ditional physics of outflows included in the full model leads 
to a greater dependence of baryonic properties on details of 
the tree formation history. Nevertheless, given the size of 
the error bars the results are consistent with the trees and 
ODE solver being converged with respect to these numerical 
parameters. One possible marginal exception is the case of 
the baryonic masaj of central galaxies at z = 1 in the full 
model — there are a few points which deviate significantly, 
which could suggest that the trees are not sufficiently ac- 
curate for this model at high-z. The effect is marginal, but 
should be kept in mind when considering convergence in 
properties of high-z galaxies in the full model. 



3 RESULTS 

We now compare results obtained from A'^stcp = oo trees 
with those from trees with finite A'^stcp and assess the degree 
of convergence as a function of A'stcp- 



3.1 Individual Galaxies 

We begin by considering the evolution of individual galax- 
ies, chosen to be the main progenitors of the central galaxy 
of each z = halo. Figure |4] shows the stellar and total 
baryonic masses as a function of time for two representative 
galaxies (corresponding to 2; = halos with masses in the 
range lO^'^-lO^^M©) from the simple and full models. Line 
colours indicate the different values of A'stop used. Consid- 
ering the simple model first, it is clear that convergence is 
reached quite rapidly — while the evolution for A'^step = 16 is 
significantly different from the A'^stcp = 00 case, A'stop = 32 
is sufficient to reproduce the evolution with reasonable ac- 
curacy and A'stop 5S 128 results in a faithful recreation of the 
A'stop = 00 result. 

The full model shows a rather different behaviour. Con- 
vergence is generally slower and, more importantly, there are 
cases where even for A'stop = 256 the evolution can diverge 
substantially from the A^stop = 00 case. The reason for this 
is that the evolution of a given galaxy can depend crucially 
on individual events. For example, a change in the structure 
of the merger tree due to regridding could cause a galaxy 
merger event to change from being considered "minor" to 
being "major". Given the rules adopted by semi-analytic 
models this can result in very different properties for the 
galaxy emerging from the merger event and, therefore, to 
dramatically different evolution at later times. This can be 
seen clearly in the lower right panel of Fig. [4] in which the 
A'stop = 256 case diverges from the other models very early 
and consequently differs by a factor of 3 in mass compared 
to the A'stop = 00 case at 2; = 0, while even the A'stop = 16 
model gets very close to the evolution and final mass of the 
A'stop = 00 case. 



Fortunately, semi-analytic models are not usually in the 



^ Following the usual logic of semi-analytic models, the baryonic 
mass of each galaxy is taken to be the mass of stars plus any 
mass in cold interstellar medium gas. Hot gas (with a temperature 
approximately equal to the virial temperature of the halo) and 
gas in cold flows is instead associated with the halo and is not 
included in the mass of the galaxy as discussed here. 



business of predicting the properties of individual galaxieaM 
but rather in making statistical predictions for populations 
of galaxies. Divergences due to temporal regridding such as 
are seen in Fig. H] should cancel to some extent when aver- 
aged over many galaxies. In the following sections we will 
assess how effective this averaging is by considering the con- 
vergence of average properties of galaxies. 



3.2 Statistical Comparison 

3.2.1 Stellar and Baryonic Masses 

Figure [5] shows results for the convergence in the average to- 
tal (gaseous plus stellar) mass of central galaxies in bins of 
dark matter halos mass for both simplified and full models 
with snapshot spacings uniform in a, In a and In (5c. Symbol 
colours correspond to different numbers of snapshots as in- 
dicated in each panel. In each case we plot the ratio of the 
mean mass of central galaxies to that found in the A' — >■ 00 
calculations. Error bars on the points are an estimate of the 
error on the mean due to the finite number of merger trees 
realized in each mass birPl For the simple models, it can be 
seen that in all cases convergence is fastest for lower mass 
halos, and gets progressively worse for higher mass halos. 
This could be due to the fact that low mass halos are poorly 
resolved anyway (due to the finite mass resolution imposed 
on the trees) and so re-gridding does not lose significant in- 
formation, whereas high mass halos are well resolved and 
their merger trees contain substantial information (i.e. they 
have much "richer" formation histories) which is lost by re- 
gridding^ Alternatively, it may simply be that the baryonic 



^ One exception to this is the study of ptringer et al.| |2010| who 
follow the formation of a galaxy in a merger tree extracted from an 
N-body+hydrodynamical simulation and compare its properties 
and evolution to the same galaxy found in the same individual 
merger tree in the simulation. That study used 49 timesteps and 
found very good agreement between the semi-analytic and sim- 
ulated galaxy properties. Based on the results obtained here, we 
would predict that for a significant fraction of trees similar stud- 
ies would find quite divergent results between the two techniques. 
Another class of exception is in applying semi-analytic techniques 
to high resolution simulations of individual halos, such as the 
Aquarius (j Springel et al.||2008[ l and Via Lactea ( |Kuhlen et al.| 
]2008) simulations. 

' in general, this error is largest in the lower mass halos — a con- 
sequence of the greater dispersion in galaxy properties in these 
halos. In massive halos, central galaxies form through multiple 
mergers which results in an averaging that reduces the scatter 
in their properties, while when we consider all galaxies in the 
halo the dispersion is reduced in massive halos simply because we 
average over many more galaxies. 

* It is not obvious how this loss of information is best quantified. 
Information is stored in both the structure of the connected tree 
and in the labels (mass and time) associated with each node. Sim- 
ply estimating the information content based on the number of 
bits required to store a tree is misleading, as the labels associated 
with nodes are not entirely independent (i.e. along a given branch 
the mass changes in a smooth and predictable way between bifur- 
cations). One approach is to consider just the connected structure 
of the tree, ignoring the mass and time information associated 
with each node. In that case, methods that have been developed 
for analyzing phylogenetic trees can be adopted. For example, the 
cladistic information content, CIC, provides a useful measure of 
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Figure 3. Convergence of the mean total baryonic mass of galaxies as a function of halo mass with respect to other numerical parameters 
in Galacticus. Panels in the left column correspond to the simple model, while those in the right column correspond to the full model. 
Rows correspond to central galaxies at 2 = 0, all galaxies at 2 = and central galaxies at z = 1 from top to bottom. In all cases, the 
earliest snapshot is at 1-1-2 = 20 and the final snapshot at 1-1-2 = 1. Symbol colour corresponds to the parameter values used, while error 
bars indicate the error on the mean mass due to the finite number of merger trees realized in each bin. Points in each mass bin are given 
small horizontal offsets for clarity. The parameters ei, e2 and e-j, control the accuracy with which the A'step = oo trees are built. Reducing 
their values results in more accurate tree construction and clearly shows that the Nstep = oo trees are well converged with respect to these 
parameters. The parameters r^oi and atoi (corresponding to input parameters [odeToleranceRelative] and [odeToleranceAbsolute] in 
Galacticus) control the accuracy with which baryonic physics is solved in Galacticus. Again, reducing their values results in more 
accurate solutions and shows that the results are sufficiently converged for this study. 
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Figure 4. The evolution of the stellar (dashed lines) and total baryonic (solid lines) mass in the main progenitor galaxy in representative 
merger trees is shown from z ^ 6 to z = 0. Colours indicate the number of snapshot times used in each calculation. The left column 
shows results for the simple model, while the right panel shows results for the full model. 



physics (e.g. cooling) is more sensitive to the details of the 
merger tree in higher mass systems. We have tested these 
scenarios by re-running the higher mass merger trees using 
a mass resolution of 1.2 x IO^^A/q (i.e. a mass 100 times 
larger than in our standard cases). We find that this results 
in even slower convergence, suggesting that it is the baryonic 
physics that matters and not the amount of information in 
the merger trees themselves. For the full models convergence 
is worst for intermediate mass halos, becoming more rapid 
in the highest mass systems. 

In all cases, convergence to within 5% across all masses 
requires A^stop = 128. Convergence occurs at very similar 
rates irrespective of the choice made for the distribution of 
snapshots in time. Snapshots distributed uniformly in the 
logarithm of expansion factor or critical overdensity for col- 
lapse result in very marginally faster convergence in some 



the information content of a sparse tree which is assumed to rep- 
resent an underlying binary tree jThorley et al.||l998[ l. For the 
trees constructed in this work prior to any re-gridding a IO^'^Mq 
tree has a CIC of approximately 14 bits while a 10^'^Mq tree 
has a CIC of around 5000 bits. Re-gridding these trees onto 32 
snapshots spaced uniformly in the logarithm of expansion factor 
results in an information loss of around 2 and 800 bits respec- 
tively. 



cases, but this seems to be a small effect. Given the (ex- 
pected) similarity in the results for spacings uniform in ln(a) 
and ln{Sc) we will not show results for spacing uniform in 
ln(5c) in subsequent figures. 

At intermediate halo masses, convergence seems to 
occur from opposite directions in the simple and full 
models. Specifically, models with low A'^stcp systematically 
under(over)-predict the true mass in the simple(full) model. 
The effects of supernovae-driven outfiows, present in the full 
model but not the simple model, are ultimately responsible 
for this difference. In the simple model, low mass dark mat- 
ter halos are able to efficiently accrete and cool gas into 
the galaxy phase and then later merge with central galax- 
ies. In low A^stop trees, many of these halos are missed (they 
form and are subsumed by larger halos between successive 
timesteps) and so do not form galaxies. This reduces the 
mass brought in to central galaxies through merging, caus- 
ing their mass to be underestimated. In the full model, the 
supernovae-driven outflows eject most of the mass which 
condenses into low mass halos, reducing their mass and their 
subsequent contribution to the growth of central galaxies. 

Figure [6] shows the same information but now for just 
the stellar masses of central galajcies at z = 0. In this figure, 
we show a second set of errorbars (indicated by thinner lines) 
which show the root variance (divided by a factor of 5 to 
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keep the error bars smaller than the scale of the y-axis) in 
the distribution of galaxy stellar masses (as opposed to the 
error on the mean which is much smaller). These relatively 
large dispersions in masses illustrate the need for averaging 
over many merger trees to obtain accurate estimates of the 
degree of convergence in the mean quantities. The rate of 
convergence is very similar to the case of total baryonic mass 
overall, and the same conclusions apply — A'atop = 128 is 
required for convergence to within 5% across the entire mass 
range. 

When we consider all galaxies (i.e. we sum the masses of 
all galaxies, satellites and centrals, in a halo and then find 
the mean of this quantity over many realizations) results 
change somewhat as shown in Fig. [7] and |8] For example, 
the systematic offset when A^stcp is small is in the opposite 
direction for high mass halos, now always over-predicting 
the mass. We find that A^stcp = 64 is sufficient for 5% con- 
vergence in total baryonic mass, while A'^ = 128 is required 
for the same degree of convergence in stellar mass. 

Figure [9] shows convergence in the stellar mass of cen- 
tral galaxies at z = 1 and 3, as a function of their halo mass 
at those redshifts. Note that the number of snapshots always 
refers to the total number from z = 20 to z = even when 
results are shown for 2 > 0. Errors grow rapidly with in- 
creasing redshift. Convergence is slower than for z = 0, par- 
ticular in the case of low mass halos at z = 3. The properties 
of galaxies at these redshifts depend only on the structure 
of their progenitor tree at yet higher redshifts, such that the 
number of relevant snapshots is significantly less than A'stop. 



3.2.2 Numbers of Galaxies 

Figure [lO] shows convergence in the number of viable sub- 
halos per isolated halo at different redshifts. An "isolated 
halo" is one which is not a substructure within a larger halo 
and so corresponds to the type of halo that might be found 
by a friends-of-friends algorithm in an N-body simulation. 
By a "viable subhalo" we mean any subhalo (including the 
main subhalo which hosts the central galaxy) which at some 
point in the merger tree was an isolated halo and so would 
have had the opportunity to potentially accrete gas from 
the intergalactic medium and form a galaxy. Whether or 
not such a subhalo actually would form a galaxy depends 
on the baryonic physics. Here we are simply assuming that 
any halo which was never isolated definitely would not form 
a galaxjPl Convergence at low redshift occurs at the same 
rate when timesteps spaced uniformly in a or ln(a) are used, 
while at high redshift timesteps spaced uniformly in ln(a) 
give faster convergence. This simply refiects the relative den- 
sity of timesteps at low and high redshifts under these two 
choices for timestep distribution. We find once again that 



^ Non- viable substructures could of course be detected in N-body 
simulations, using an appropriate substructure finding algorithm 
such as SuBFiND I jSpringel et al.|2001^ . Therefore, the results in 
Fig. |10| are not directly relevant to HOD models or abundance 
matching models that are based on subhalo finding algorithms. 
However, within the context of current semi-analytic models, such 
subhalos could not form a galaxy, and so these results are relevant 
to HOD models that are based on fits to results from semi-analytic 
models that used N-body-derived merger trees. 



128 steps are sufficient to achieve convergence in the num- 
ber of viable subhalos to better than 5% in all cases. Using 
64 steps can result in errors of up to 10-15% at higher red- 
shifts. 



3.2.3 Mass Functions and Star Formation Rates 

It is interesting to assess the convergence in statistics more 
closely related to observable quantities. In Fig. ITT] we show 
convergence in the stellar mass function at z = (upper 
panels) and the volume density of star formation rate as a 
function of redshift (lower panels) . 

The stellar mass functions show significant and system- 
atic offsets as a function of A^atep- In both simple and full 
models convergence is worse for lower mass galaxies, with 
A'atop ~ 128 in the simple model (A^stop = 64 in the full 
model) being required to ensure better than 10% conver- 
gence. 

For the star formation rate as a function of redshift we 
find significant and systematic offsets from the A'atop = oo 
case. In the simple model, using a finite number of timesteps 
results in an overestimate of the star formation rate at z = 
with a larger overestimate at z = 4-6. Using A^stcp = 128 
ensures convergence to better than 10% across all redshifts. 
In the full model, convergence is more, rapid with even 
A^atep = 32 getting close to 10% or better convergence at 
all redshifts considered. 



3.2.4 Major Merger Times 

Finally, we examine the convergence in the distribution of 
merging times. Specifically, we consider the time since the 
last major merger experienced by central galaxies as a func- 
tion of their halo mass. We compute the mean of this dis- 
tribution, excluding any central galaxies which never expe- 
rienced a major merger. Figure [12] shows the resulting con- 
vergence in this quantity as a function of A^stop at z = 
(left panel) and z = 1 (right panel) for the full model. For 
low values of A'^stcp offsets from the true value are clearly 
seen, but in each case A'atop = 64 is sufficient to achieve a 
converged answer. 



4 CONCLUSIONS 

It has become common practice to extract histories of the 
hierarchical merging process ("merger trees") from cosmo- 
logical N-body simulations and to use these as inputs to 
semi-analytic models of galaxy formation. Previously, there 
has been little consideration of how the temporal resolution 
of these trees affect the properties of the resulting galaxies. 
In this work we performed a convergence study using the 
Galacticus toolkit. 

We find that 128 snapshots spaced uniformly in the 
logarithm of expansion factor (or, almost equivalently, in 
the logarithm of the critical overdensity for collapse) pro- 
vide good (~ 10%) convergence in galaxy stellar and to- 
tal masses, the number of viable subhalos (i.e. those which 
have progenitors in the merger tree that are isolated, non- 
substructure halos), distributions of merger times, in stellar 
mass functions at z = and in the volume density of star 
formation rate as a function of redshift. Smaller numbers 



Convergence of Galaxy Properties with Merger Tree Temporal Resolution 9 



Simple model; Uniform in a from 1 + z = 20; Baryonic ma,ss; Central galaxies; z=0 



Full model; Uniform in a from 1 + z = 20; Baryonic ma,ss; Central galaxies; z=0 



1.4 



1.2 - 



S 1 - 



0.8 



0.6 






% \ 



. Nam = ' 



tfZ = 128 



256 « 



W... 



64 o 
^32 • 
: 16 O 



10' 



lO'-t 



10'- 10'" 

M|„i„[Mo] 
Simple model; Uniform in In(fl) from 1 + z = 20; Baryonic mass; Central galaxies; z=0 



1.4 - 



I 1.2 



S 1 



J. 0.8 - 



0.6 - 




10 



10" 



10'^ 10'- 

Mhi,io[Mo] 

Simple model; Uniform in \vi{5c) from 1 + z = 20; Baryonic mass; Central galaxies; z=0 





1.4 












■ 


i 


1.2 














ST 
















































a 




M 












^ 

~~~\ 
>" 


1 


«f 


•» 


^ 


^ 


• 


• 

<5 


^ 








* 




* 




i 


0.8 




• 




* 




• 




0.6 


• 



• 

9 






• 


• 



10" 



10'^ 10'3 

Mh>,i„[Mo] 



10'- 




10'- 10'-' 

M|„i„[Mo] 
Full model: Uniform in In(fl) from 1 -t- z = 20; Baryonic mass; Central galaxies; z=0 



1.4 


■ 


{ J 


I 1-2 

^ 1 

1' 


^ 


H <» n^ 


^0.8 
0.6 


Naicc = °° I 

N„,„ = 256 • 

W p = 128 • 

nL = 64 

^4 = 16 • 


♦ 



10" 



10" 



10'2 10'-' 

Mhi,io[Mo] 

Full model; Uniform in ln(Sc) from 1 -f z = 20; Baryonic mass; Central galaxies; z=0 




10'- 10" 

M|,i,io[Mo] 



Figure 5. Convergence with number of snapshots of the mean total baryonic mass in central galaxies at 2 = as a function of halo 
mass. Panels in the left column correspond to the simple model, while those in the right column correspond to the full model. Rows 
correspond to snapshots uniformly spaced in a, ln(a) and ln((5c) from top to bottom. In all cases, the earliest snapshot is at 1 -|- z = 20 
and the final snapshot at 1 -|- 2 = 1. Symbol colour corresponds to the number of snapshots used, while error bars indicate the error on 
the mean mass due to the finite number of merger trees realized in each bin. Points in each mass bin are given small horizontal offsets 
for clarity. 



of snapshots lead to rapidly diverging results and should 
be avoided. We also considered snapshots spaced uniformly 
in expansion factor. We find that no substantial difference 
in the number of timesteps required to reach a given de- 
gree of convergence using this distribution of snapshots. This 
convergence is obtained for mean quantities averaged over 



large samples of galaxies — the full model in particular shows 
significant variance for individual galaxies even when using 
very large numbers of snapshots. 

Our results should provide guidance as to how many 
snapshots should ideally be stored from future N-body simu- 
lations to ensure that the resulting temporally sparse merger 
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Figure 6. As Fig. [5] but for the stellar mass of central galaxies. Additionally, we show a second set of errorbars (indicated by thinner 
lines) which show the root variance (divided by a factor of 5 to keep the error bars smaller than the scale of the y-axis) in the distribution 
of galaxy stellar masses (as opposed to the error on the mean, shown by the thicker errorbars, which is much smaller). 



trees do not overly limit the accuracy of subsequent galaxy 
formation calculations. Our results are for a specific set of 
cosmological parameters and tree mass resolution, in ad- 
dition to being for a specific implementation of baryonic 
physics. The rate of convergence plausibly depends on all 
of these factors, and will likely differ for galaxy proper- 
ties other than those considered here. Since Galacticus is 
freely available as an open source projecIrM it is relatively 
easy for anyone to repeat the analysis performed here for a 
specific set of simulation parameters and galaxy properties. 
Our results were obtained with v0.9.0.r49f of Galacticus 
and we have made the input parameters available onlina_] 
Increasing mass resolution in simulations implies that 
merger trees contain more information. However, for this in- 
formation to be folded into semi-analytic model predictions, 
trees must be built with finer time-stepping. Thus, increas- 
ing mass resolution would imply increasing both the size of 



each snapshot and the number of such snapshots, thereby 
causing a "data tsunami". It would then be recommendable 
for large high resolution simulations to be post-processed 
on-the-fly, writing at finely spaced times only (sub)halo cat- 
alogues instead of the entire snapshot. 

Recent interest in exploring and constraining the pa- 
rameter space of semi-analytic galaxy formation models 



(Henriques et al. 2009 Bower et al. 2010 Lu et al. 20101 



makes it crucial to understand and control numerical inac- 
curacies in such codes. Otherwise, quantitative constraints 
on model parameters will be subject to unknown systematic 
biases. While many uncertainties remain in our understand- 
ing of the physics of galaxy formation, it is important to 
ensure that numerical results are converged. Considerations 
such as those described here should become a standard part 
of any galaxy formation study. 



^° Galacticus can be downloaded 

: //sites .google . com/site/galacticusmodel' 



from 



|http 



i'he input parameter tiles 
struct plots of the results ca 



and 
I be 



scripts to con- 
downloaded from 



http : //www . ctcp . caltech . edu/galacticus/parameters 



/dmTreeConvergence . tar . bz2| 



ACKNOWLEDGMENTS 

AJB acknowledges the support of the Gordon & Betty 
Moore Foundation. SB has been partially supported by the 
PD51 INFN grant and by the PRIN-INAF grant "Towards 
an Italian Network for Computational Cosmology". GDL ac- 
knowledges financial support from the European Research 



Convergence of Galaxy Properties with Merger Tree Temporal Resolution 11 



Simple model; Uniform in a from 1 + z = 20; Baryonic mass; All galaxies; z=0 



Full model; Uniform in a from \ + z = 20: Baryonic mass; All galaxies; z-0 



1.4 












■ 


i 1.2 

i 

i 1 
t 


- J 


J 


/ 


/ 


/ 


/ : 


to.8 
0.6 


KtZ = 128 


• 
« 



• 

8 








■ 



10" 



10'^ 10'-' 



10'- 



Simple model; Uniform in ln(a) from 1 -t- z = 20; Baryonic mass; All galaxies; z=0 



1.4 


■ 


I 1.2 




^ 1 

1 


JJJJJJ 


J 0.8 


Nstec = o° O 

N,^„ = 256 • 




Cp = 128 • 




£p = 64 


0.6 


^4 = 32 • 
^4 = 16 O 



10" 



10'^ 10': 

MwAolMo] 



10'- 



1.4 






■ 


I 1.2 




\ 




a" 

J" 
S 1 

a' 


^ % 




J ^ 


^a 0.8 
0.6 


Af.„p 1=256 • 

A^„p = 128 • 

Wsiep = 64 




. 



10" 



10'^ 10'-' 

A^haio[-Me] 



10'^ 



Full model; Uniform in ln(a) from 1 + s = 20; Baryonic mass; All galaxies; z=0 



1.4 








■ 


l>-2 

5^ 1 


: y 


\! 


< 


** «* «» 


^ 




i 






J. 0.8 
0.6 


'Vstep = oo 

|»P = 64 




• 
• 



• 
o 




■ 



10" 



10'2 10'^ 

jWhalo[A'o] 



10'^ 



Figure 7. As Fig. |5] but for the total mass of all galaxies. 



Council under the European Community's Seventh Frame- 
work Programme (FP7/2007-2013)/ERC grant agreement 
n. 202781. MBK acknowledges support from the Southern 
California Center for Galaxy Evolution, a multi-campus re- 
search program funded by the University of California Office 
of Research. We thank the KITP, Santa Barbara, where this 
work was begun, for their hospitality. This work made ex- 
tensive use of Amazon's Elastic Compute Cloud through a 
generous grant from the Amazon in Education program. 



REFERENCES 

Benson A. J., 2011. NewA, in press (arXiv: 1008. 1786) 
Benson A. J., Kamionkowski M., Hassani S. H., 2005, MN- 

RAS, 357, 847 
Bower R. C, Vernon I., Goldstein M., Benson A. J., Lacey 

C. G., Baugh C. M., Cole S., Frenk C. S., 2010, MNRAS, 

407, 2017 
Boylan-Kolchin M., Springel V., White S. D. M., Jenkins 

A., Lemson G., 2009, MNRAS, 398, 1150 
Cole S., Lacey C. G., Baugh C. M., Frenk C. S., 2000, 

MNRAS, 319, 168 
Croton D. J., Springel V., White S. D. M., De Lucia G., 

Frenk C. S., Gao L., Jenkins A., Kauffmann G., Navarro 

J. F., Yoshida N., 2006, MNRAS, 365, 11 
De Lucia G., Blaizot J., 2007, MNRAS, 375, 2 



Eisenstein D. J., Hu W., 1999, ApJ, 511, 5 

Hatton S., Devriendt J. E. G., Ninin S., Bouchet F. R., 

Guiderdoni B., Vibert D., 2003, MNRAS, 343, 75 
Helly J. C, Cole S., Frenk C. S., Baugh C. M., Benson A., 

Lacey C, 2003, MNRAS, 338, 903 
Henriques B. M. B., Thomas P. A., Oliver S., Roseboom 

I., 2009, MNRAS, 396, 535 
Hopkins A. M., 2004, Astrophysical Journal, 615, 209 
Klypin A., Trujillo-Gomez S., Primack J., 2010 
Komatsu E., Smith K. M., Dunkley J., Bennett C. L., Gold 

B., Hinshaw G., Jarosik N., Larson D., Nolta M. R., Page 

L., Spergel D. N., Halpern M., Hill R. S., Kogut A., Limon 

M., Meyer S. S., Odegard N., Tucker G. S., Weiland J. L., 

WoUack E., Wright E. L., 2011, ApJS, 192, 18 
Kuhlen M., Diemand J., Madau P., Zemp M., 2008, Journal 

of Physics Conference Series, 125, 2008 
Li C, White S. D. M., 2009, MNRAS, 398, 2177 
Lu Y., Mo H. J., Weinberg M. D., Katz N. S., 2010 
Monaco P., Fontanot F., Taffoni G., 2007, MNRAS, 375, 

1189 
Parkinson H., Cole S., Helly J., 2008, MNRAS, 383, 557 
Prada F., Klypin A. A., Cuesta A. J., Betancort-Rijo J. E., 

Primack J., 2011 
Somerville R. S., Hopkins P. F., Cox T. J., Robertson B. E., 

Hernquist L., 2008, MNRAS, 391, 481 
Springel V., Wang J., Vogelsberger M., Ludlow A., Jenkins 



12 Benson et al. 

Simple model; Uniform in a from 1 + z = 20; Stellar mass; All galaxies; z=0 
1.4 - 

I 1.2 - 



Full model; Uniform in a from 1 + z = 20; Stellar mass; All galaxies; z=0 



0.8 



0.6 



■ 








- 


^ 


; 


J 


J 




A'slep = °° 
Waep = 256 


• 
• 

o 

« 
o 






- 



1.4 














I 1.2 


i 


1 ■■ 


1 


1 


♦ 


♦ 


i 

s 

* 1 


f 


^ 


J! 


# 

o 


* 

^ 


* 

J 


:^ 


f 










^•0.8 


A'slep = o° 
Wslep = 256 




« 
• 










" 


0.6 


■ H^u 


• 
« 








- 



10" 10'- 10" 10" 10" 10'- 10" lo'-* 

Simple model; Unifomi in ln(a) from 1 + z = 20; Stellar inass; All galaxies; z=0 Full model; Uniform in in(a) from 1 + z = 20; Stellar mass; All galaxies; z=0 



1.4 - 



'L 1.2 



0.8 - 



0.6 - 



■ 




- 


^ J 


J 


1 ♦ * 


NsKV = °° 

WsMp = 256 • 

nL = 64 

A'4 = 32 • 

^4 = 16 O 




: 



1.4 - 



'L 1.2 



S 1 



a 0.8 - 



0.6 - 



10" 



10'- 10'- 



■ 


\ * 


♦ ♦ 


^ 


♦// 


^ J . 


Ar.»p 1=256 •! 

iv4 = 128 • 

nL = 64 

n4 = 32 m 




- 



10'- 



10" 



10'- 10'- 



10" 



Figure 8. As Fig. [s] but for the stellar mass of all galaxies. 
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Simple model; Uniform in a from 1 + z = 20; Number; All galaxies; z=3 
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Simple model; Uniform in ln(a) from 1 + z - 20; Number; All galaxies; z=0 



1.4 












■ 


§ '■' 












- 


i 














^ 1 

i 


d* 


*^ 


■^ 


i 


*4 

* 


* 


SS 0.8 

0.6 


"step = 0° 

WsKp = 256 
n4 = 128 


• 
• 

• 
3 








■ 



10" 



10'- 



10'^ 10'^ 

Mhalo[Mo] 

Simple model; Uniform in In(fl) from 1 + z = 20; Number; All galaxies; z=l 



1.4 












■ 


8 1-2 












. 


a! 














^ 1 


* 

t 


« 

• 
• 

• 
3 


% 


% 


% 


% 


s: 0.8 

0.6 


« 

NstCD = °° 

WsKp = 256 

K^'=p = ?i 

A^sKp = 16 


* 
• 


* 
« 


« 
* 


• 



10" 



10'- 10'- 



10'^ 



Simple model; Uniform in In(fl) from 1 + z = 20; Number; All galaxies; z=3 
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Figure 10. As Fig. [s] but for the number of viable subhalos per isolated halo. The left-hand column use snapshots uniformly spaced in 
a, while the right-hand column has snapshots uniformly spaced in ln(a). Rows indicate results for 2 = 0, 1 and 3 (from top to bottom). 
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Figure 11. Convergence with number of snapshots of the stellar mass function of galaxies at z = (upper panels), and the star formation 
history as a function of redshift (lower panels). Panels in the left column correspond to the simple model, while those in the right column 
correspond to the full model. In all cases snapshots are uniformly spaced in ln(a), and the earliest snapshot is at 1 + 2 = 20 with the 
final snapshot at 1 + z = 1. Symbol/line colour corresponds to the number of snapshots used. In the stellar mass function panels, error 
bars indicate the error on the mean mass due to the finite number of merger trees realized in each bin and points in each mass bin are 
given small horizontal offsets for clarity. 
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Figure 12. Convergence with number of snapshots of the times since the last major merger for central galaxies at z = 0. The left panel 
corresponds to the simple model, while the right panel corresponds to the full model. Snapshots are uniformly spaced in ln(a), and the 
earliest snapshot is at 1 + z = 20 with the final snapshot at 1 + z = 1. Symbol colour corresponds to the number of snapshots used. Error 
bars indicate the error on the mean mass due to the finite number of merger trees realized in each bin. Points in each mass bin are given 
small horizontal offsets for clarity. 



